Small Maximal Independent Sets and Faster Exact Graph Coloring 

David Eppstein* 



Abstract 

We show that, for any «-vertex graph G and integer parameter k, there are at most 34A:-n4«-3i jj^^x- 
imal independent sets I C G with |/| < k, and that all such sets can be Hsted in time e)(3'*'=-"4"-3*^). 
These bounds are tight when n/4 < k < n/3. As a consequence, we show how to compute the 
exact chromatic number of a graph in time 0{{4/3 + 3'*/^/4)") « 2.4150", improving a previous 
©((I + 3'/^)") « 2.4422" algorithm of Lawler (1976). 

1 Introduction 

One of the earliest works in the area of worst-case analysis of NP-hard problems is a 1976 paper by 
Lawler ^ on graph coloring. It contains two results: an algorithm for finding a 3-coloring of a graph 
(if the graph is 3-chromatic) in time 0{3"^^) « 1.4422", and an algorithm for finding the chromatic number 
of an arbitrary graph in time 0{{l + 3^/^)") « 2.4422". Since then, the area has grown, and there has been 
a sequence of papers improving Lawler's 3-coloring algorithm [|l],^^^, with the most recent algorithm 
taking time « 1.3289". However, there has been no improvement to Lawler's chromatic number algorithm. 

Lawler's algorithm follows a simple dynamic programming approach, in which we compute the chro- 
matic number not just of G but of all its induced subgraphs. For each subgraph S, the chromatic number is 
found by listing all maximal independent subsets I C S, adding one to the chromatic number of S \I, and 
taking the minimum of these values. The 0{{\ + 3^/'')") running time of this technique follows from an 
upper bound of 3"/^ on the number of maximal independent sets in any «-vertex graph, due to Moon and 
Moser [^. This bound is tight in graphs formed by a disjoint union of triangles. 

In this paper, we provide the first improvement to Lawler's algorithm, using the following ideas. First, 
instead of removing a maximal independent set from each induced subgraph S, and computing the chromatic 
number of S from that of the resulting subset, we add a maximal independent set of G\S and compute the 
chromatic number of the resulting superset from that of S. This reversal does not itself affect the running time 
of the dynamic programming algorithm, but it allows us to constrain the size of the maximal independent 
sets we consider to at most \S\/3. We show that, with such a constraint, we can improve the Moon-Moser 
bound: for any «-vertex graph G and integer parameter k, there are at most 34^-«4"-3^ maximal independent 
sets I C G with |/| < k. This bound then leads to a corresponding improvement in the running time of our 
chromatic number algorithm. 



2 Preliminaries 

We assume as given a graph G with vertex set V{G) and edge set E{G). We let n = \V{G) \ and m = \E{G)\. 
A proper coloring of G is an assignment of colors to vertices such that no two endpoints of any edge share the 
same color. We denote the chromatic number of G (the minimum number of colors in any proper coloring) 

by x{G). 
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If V{G) = {vq, vi, . . . v„_i}, then we can place subsets 5 C V{G) in one-to-one correspondence with 
theintegersO, 1,...2" - 1: 

Vies 

Subsets of vertices also correspond to induced subgraphs of G, in which we include all edges between 
vertices in the subset. We make no distinction between these three equivalent views of a vertex subset, so 
e.g. we will write x(S) to indicate the chromatic number of the subgraph induced by set S, and X[S] to 
indicate a reference to an array element indexed by the number J2vies'^'- write S < T to indicate the 
usual arithmetic comparison between two numbers, and 5 C T to indicate the usual (proper) subset relation 
between two sets. Note that, if S C T, then also S < T, although the reverse implication does not hold. 

A set 5 is a maximal k-chromatic subset of T if 5 C r, xi^) = k, and x{^') > ^ for every S <Z S' C.T. 
In particular, ifk= 1, 5 is a maximal independent subset of T. 

For any vertex v G we let A'^(v) denote the set of neighbors of v, including v itself. If 5 and T are 

sets, S\T denotes the set-theoretic difference, consisting of elements of S that are not also in T. Kj denotes 
the complete graph on i vertices. We write deg(v, S) to denote the degree of vertex v in the subgraph induced 
by S. 

We express our pseudocode in a syntax similar to that of C, C++, or Java. In particular this implies that 
array indexing is zero-based. We assume the usual RAM model of computation, in which a single cell is 
capable of storing an integer large enough to index the memory requirements of the program (thus, in our 
case, n-bit values are machine integers), and in which arithmetic and array indexing operations on these 
values are assumed to take constant time. 



3 Small Maximal Independent Sets 

Theorem 1 Let G be an n-vertex graph, and k be a nonnegative number. Then the number of maximal 
independent sets I C V{G) for which \I\ < k is at most 2^k-n^n-3k 

Proof: We use induction on n; in the base case n = 0, there is one (empty) maximal independent set, and 
for any k>0,l< 3'^^4-3^ = (81/64)^ Otherwise, we divide into cases according to the degrees of the 
vertices in G, as follows: 

• If G contains a vertex v of degree three or more, then each maximal independent set / either contains 
V (in which case / \ {v} is a maximal independent set of G \ A'^(v)) or it does not contain v (in which 
case / itself is a maximal independent set of G \ {v}). Thus, by induction, the number of maximal 
independent sets of cardinality at most k is at most 

2^4k-{n-l)^{n-[)-3k _|_ 2,4{k-l)-{n-4) ^{n-4)-3{k-l) = (1 _|_ }_\T^4k-n^n-3k ^ ■T^4k-n^n-3k 

as was to be proved. 

• If G contains a degree-one vertex v, let its neighbor be u. Then each maximal independent set contains 
exactly one of u or v, and removing this vertex from the set produces a maximal independent set of 
either G \ N{u) or G \ A^(v). If the degree of u is d, this gives us by induction a bound of 

34(;t-l)-(n-2)4(n-2)-3(*-l) _^ 2'^{k-l)-{n-d-l) ^{n-d-l)-3{k~l) < ^24k-n^n-3k 

9 

on the number of maximal independent sets of cardinaUty at most k. 
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• If G contains an isolated vertex v, then each maximal independent set contains v, and the number of 
maximal independent sets of cardinality at most k is at most 




• If G contains a chain u-v-w-x of degree two vertices, then each maximal independent set contains u, 
contains v, or does not contain u and contains w. Thus in this case the number of maximal independent 
sets of cardinality at most k is at most 



• In the remaining case, G consists of a disjoint union of triangles, all maximal independent sets have 
exactly «/3 vertices, and there are exactly 3"/^ maximal independent sets. \f k > n/3, then 3"/^ < 
T^Ak-n^n-^k _ If < n/3, there are no maximal independent sets of cardinality at most k. 



Croitoru [^] proved a similar bound with the stronger assumption that all maximal independent sets have 
|/| < k. When n/4 < k < n/3, our result is tight, as can be seen for a graph formed by the disjoint union of 
Ak — n triangles and n — 3k A'4's. 

Theorem 2 There is an algorithm for listing all maximal independent sets smaller than k in an n-vertex 
graph G, in time C)(34^-M"-3^). 

Proof: We use a recursive backtracking search, following the case analysis of Theorem |l]: if there is a 
high-degree vertex, we try including it or not including it; if there is a degree-one vertex, we try including 
it or its neighbor; if there is a degree-zero vertex, we include it; and if all vertices form chains of degree- 
two vertices, we test whether the parameter k allows any small maximal independent sets, and if so we 
try including each of a chain of three adjacent vertices. The same case analysis shows that this algorithm 
performs (9(34^-«4"-3^^ recursive calls. 

Each recursive call can easily be implemented in time polynomial in the size of the graph passed to the 
recursive call. Since our 34^-«4«-3/: bound is exponential in n, even when k = Q, this polynomial overhead 
at the higher levels of the recursion is swamped by the time spent at lower levels of the recursion, and does 
not appear in our overall time bound. □ 

A more detailed pseudocode description of the algorithm is shown in Figure |l|. The given pseudocode 
may generate non-maximal as well as maximal independent sets, because (when we try not including a high 
degree vertex) we do not make sure that a neighbor is later included. This will not cause problems for 
our chromatic number algorithm, but if only maximal independent sets are desired one can easily test the 
generated sets and eliminate the non-maximal ones. The pseudocode also omits the data structures necessary 
to implement each recursive call in time polynomial in \S\ instead of polynomial in the number of vertices 
of the original graph. 

4 Chromatic Number 

We are now ready to describe our algorithm for computing the chromatic number of graph G. We use an 
array X, indexed by the 2" subsets of G, which will (eventually) hold the chromatic numbers of certain of 
the subsets including V{G) itself. We initialize this array by testing, for each subset S, whether x{S) < 3; if 
so, we set X[S] to x{S), but otherwise we set X[S] to cxd. 



2 . 34(/t-l)-(;j-3)4(;j-3)-3(/t-l) ^ ^4{k-l)-{n-4) ^{n-4)-3{k-l) ^ }_± Tf'^k-n^n-^k 




Thus in all cases the number of maximal independent sets is within the claimed bound. 



□ 



3 



// List maximal independent subsets of S smaller than a given parameter. 
// 5 is a set of vertices forming an induced subgraph in G, 
II / is a set of vertices to be included in the MIS (initially zero), and 
// k bounds the number of vertices of S to add to /. 

// We call processMlS (/) on each generated set. Some non-maximal sets may be 
// generated along with the maximal ones, but all generated sets are independent. 

void smallMIS (set S, set /, int k) 
{ 

if (5 = or A: = 0) processMIS(I); 

else if (there exists v G 5 with deg(v, S) > 3) 

{ 

smallMIS (5 \ {v}, /, k); 
smallMIS (5 \ A^(v), / U {v}, k - 1); 

} 

else if (there exists v G 5 with deg(v, 5) = 1) 
{ 

let u be the neighbor of v; 
smallMIS (S \ N{u), I U {«}, k - 1); 
smallMIS (S \ N{v), I U {v}, k - 1); 

} 

else if (there exists v G 5 with deg(v, 5) = 0) 

smallMIS (S \ {v}, / U {v}, k - 1); 
else if (some cycle in S is not a triangle or A: > |5|/3) 
{ 

let u, V, and w be adjacent degree-two vertices, such that (if possible) u and w are nonadjacent; 
smallMIS (S \ N{u), I U {«}, k - 1); 
smallMIS {S \ N{v), I U {v}, k - 1); 
smallMIS (S \ {{u} U A^(w)), / U {w}, k - 1); 

} 

} 



Figure 1: Algorithm for listing all small maximal independent sets. 
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int chromaticNumber (graph G) 
{ 

mtX[2"]; 

for {S = 0;S< 2"; S++) 
{ 

if{x{s)<3)x[s]=x[s]; 

else X[S] = oo; 

} 

for {S = 0;S <2";S++) 
{ 

if (3 < X[S] < oo) 
{ 

for (each maximal independent set / of G \ 5 with |7| < — --) 

X[S\ 

X[SUI] = min(X[5U/],X[5] + 1); 

} 

} 

return X[V(G)]; 

} 



Figure 2: Algorithm for computing the chromatic number of a graph. 

Next, we loop through the subsets S of V{G), in numerical order (or any other order such that all proper 
subsets of each set S are visited before we visit 5 itself). When we visit S, we first test whether X[S] > 3. 
If not, we skip over S without doing anything. But if X[S] > 3, we loop through the small independent sets 
of G\S, limiting the size of each such set to using the algorithm of the previous section. For each 

independent set /, we set X[S U I] to the minimum of its previous value and X[S] + 1. 

Finally, after looping through all subsets, we return the value in X[V(G)] as the chromatic number of G. 
Pseudocode for this algorithm is shown in Figure |2[ 

Lemma 1 Throughout the course of the algorithm, for any set S, X[S] > xi'^)- 

Proof: Clearly this is true of the initial values of X. Then for any S and any independent set /, we can color 
5 U / by using a coloring of S and another color for each vertex in I, so x{S U I) < x{S) + f < X[5] + 1, 
and each step of our algorithm preserves the invariant. □ 

Lemma 2 Let M be a maximal k + \-chromatic subset of G, and let {S,I) be a partition of M into a k- 
chromatic subset S and an independent subset I, maximizing the cardinality ofS among all such partitions. 
Then I is a maximal independent subset of G \ S with \I\ < \S\/k, and S is a maximal k-chromatic subset of 
G. 

Proof: If we have any {k + l)-coloring of G, then the partition formed by separating the largest k color 
classes from the smallest color class satisfies the inequality |/| < \S\/k, so clearly this also is true when 
(S,/) is the partition maximizing \S\. If I were not maximal, due to the existence of another independent set 
I C I' C G\S, then SU/' would be a larger {k+ 1) -chromatic graph, violating the assumption of maximality 
ofM. 

Similarly, suppose there were another ^-chromatic set S C S' C G. Then if 5" n / were empty, 5" U / 
would be a + l)-chromatic superset of M, violating the assumption of M's maximality. But if 5' fl / were 
nonempty, {S',I \ S') would be a better partition than {S, I), so in either case we get a contradiction. □ 
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void color (graph G) 

{ 

compute array X as in Figure 

S = V{G); 

for (r = 2" - 1; r > 0; T ) 

{ 

if (r C 5 and X[S \T] = I and X[T] = X[S] - 1) 
{ 

color all vertices mS\T with the same new color; 
S = T; 

} 

} 

} 

Figure 3: Algorithm for optimally coloring a graph. 

Lemma 3 Let M be a maximal k + \-chromatic subset of G. Then, when the outer loop of our algorithm 
reaches M, it will be the case that X[M] = xi^)- 

Proof: Clearly, the initialization phase of the algorithm causes this to be true when x(^) < 3. Otherwise, 
let {S,I) be as in Lemma ^ By induction on \M\, X[S] = xi^) at the time we visit S. Then X[S] > 3, and 
|/| < \S\/X[S], so the inner loop for S will visit / and setZ[M] to X[S] + 1 = x{M). □ 

Theorem 3 We can compute the chromatic number of a graph G in time C((4/3 + 3^/^/4)") and space 
0{T). 

Proof: y{G) is itself a maximal x(G) -chromatic subset of G, so Lemma ^ shows that the algorithm 
correctly computes x{G) = X\y{G)]. Clearly, the space is bounded by 0(2"). It remains to analyze the 
algorithm's time complexity. 

First, we consider the time spent initializing X. Since we perform a 3-coloring algorithm on each subset 
of G, this time is 

^ 0(1.3289l^l) = r\ 1.3289') = 0(2.3289'). 

S<ZV{G) 1=0 Vv 

Finally, we bound the time in the main loop of the algorithm. We may possibly apply the algorithm of 
Theorem ^ to generate small independent subsets of each set G\S.\n the worst case, X[S\ =3 and we can 
only limit the size of the generated independent sets to |5|/3. We spend constant time adjusting the value of 
X[S U /] for each generated set. Thus, the total time can be bounded as 

^ 0(34¥-|G\^l4l^\^l-3^) = 0(£ W3T-"4"-2/) = o((l + —r). 
S(ZV{G) i=oVJ 3 4 

This final term dominates the overall time bound. □ 



5 Finding a Coloring 

Although the algorithm of the previous section finds the chromatic number of G, it is likely that an explicit 
coloring is desired, rather than just this number. The usual method of performing this sort of construction 
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task in a dynamic programming algorithm is to augment the dynamic programming array with back pointers 
indicating the origin of each value computed in the array, but since storing 2" chromatic numbers is likely 
to be the limiting factor in determining how large a graph this algorithm can be applied to, it is likely that 
also storing 2" set indices will severely reduce its applicability. 

Instead, we can simply search backwards from V{G) until we find a subset S that can be augmented by 
an independent set to form V{G), and that has chromatic number x{S) = x{G) — 1 as indicated by the value 
of X[S]. We assign the first color to G\S. Then, we continue searching for a similar subset T C S, etc., until 
we reach the empty set. Although not every set S may necessarily have X[S] = xi'^)^ it is guaranteed that 
for any 5 we can find T C S with S \ T independent and X[T] = X[S] — 1, so this search procedure always 
finds a correct coloring. 

Theorem 4 After computing the array X as in Theorem ^ we can compute an optimal coloring of G in 
additional time 0(2") and no additional space. 

We omit the details of the correctness proof and analysis. A pseudocode description of the coloring 
algorithm is shown in Figure |[ 

6 Conclusions 

We have shown a bound on the number of small independent sets in a graph, shown how to list all small 
independent sets in time proportional to our bound, and used this algorithm in a new dynamic programming 
algorithm for computing the chromatic number of a graph as well as an optimal coloring of the graph. 

Our bound on the number of small independent sets is tight for n/A < k < n/3, and an examination 
of the analysis of Theorem ^ shows that independent set sizes in this range are also the ones leading to the 
worst case time bound for our coloring algorithm. Nevertheless, it would be of interest to find tight bounds 
on the number of small independent sets for all ranges of k. It would also be of interest to find an algorithm 
for listing all small maximal independent sets in time proportional to the number of generated sets rather 
than simply proportional to the worst case bound on this number. 

Our worst case analysis of the chromatic number algorithm assumes that, every time we call the proce- 
dure for listing small maximal independent sets, this procedure achieves its worst case time bound. But is it 
really possible for all sets G \ 5 to be worst case instances for this procedure? If not, perhaps the analysis of 
our coloring algorithm can be improved. 

Can we prove a bound smaller than (") on the number of /-vertex maximal A;-chromatic induced sub- 
graphs of a graph G? If such a bound could be proven, even for = 3, we could likely improve the algorithm 
presented here by only looping through the independent subgraphs of G\S when S is maximal. 

An alternative possibility for improving the present algorithm would be to find an algorithm for testing 
whether x(G) < 4 in time o(1.415"). Then we could test the four-colorability of all subsets of G before 
applying the rest of our algorithm, and avoid looping over maximal independent subsets of G \ S unless 
X[S] > 4. This would produce tighter limits on the independent set sizes and therefore reduce the number of 
independent sets examined. However such a time bound would be significantly better than what is currently 
known for four-coloring algorithms. 
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